NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

ORION and the Three Rights: Sizing, Bundling, and Prewarming for Serverless DAGs

Mahgoub, Ashraf; Yi, Edgardo Barsallo; Shankar, Karthick; Elnikety, Sameh; Chaterji, Somali; Bagchi, Saurabh (July 2022, 16th USENIX Symposium on Operating Systems Design and Implementation (OSDI 22))

Serverless applications represented as DAGs have been growing in popularity. For many of these applications, it would be useful to estimate the end-to-end (E2E) latency and to allocate resources to individual functions so as to meet probabilistic guarantees for the E2E latency. This goal has not been met till now due to three fundamental challenges. The ﬁrst is the high variability and correlation in the execution time of individual functions, the second is the skew in execution times of the parallel invocations, and the third is the incidence of cold starts. In this paper, we introduce ORION to achieve these goals. We ﬁrst analyze traces from a production FaaS infrastructure to identify three characteristics of serverless DAGs. We use these to motivate and design three features. The ﬁrst is a performance model that accounts for runtime variabilities and dependencies among functions in a DAG. The second is a method for co-locating multiple parallel invocations within a single VM thus mitigating content-based skew among these invocations. The third is a method for pre-warming VMs for subsequent functions in a DAG with the right look-ahead time. We integrate these three innovations and evaluate ORION on AWS Lambda with three serverless DAG applications. Our evaluation shows that compared to three competing approaches, ORION achieves up to 90% lower P95 latency without increasing $ cost, or up to 53% lower $ cost without increasing tail latency.
more » « less
Full Text Available
WISEFUSE: Workload Characterization and DAG Transformation for Serverless Workflows

https://doi.org/10.1145/3489048.3530959

Mahgoub, Ashraf; Yi, Edgardo Barsallo; Shankar, Karthick; Minocha, Eshaan; Elnikety, Sameh; Bagchi, Saurabh; Chaterji, Somali (June 2022, ACM SIGMETRICS)

Full Text Available
Vulcan: a state-aware fuzzing tool for wear OS ecosystem

https://doi.org/10.1145/3386901.3397492

Yi, Edgardo Barsallo; Zhang, Heng; Maji, Amiya K.; Bagchi, Saurabh (June 2020, MobiSys '20: Proceedings of the 18th International Conference on Mobile Systems, Applications, and Services)

Full Text Available
Vulcan: lessons on reliability of wearables through state-aware fuzzing

https://doi.org/10.1145/3386901.3388916

Yi, Edgardo Barsallo; Zhang, Heng; Maji, Amiya K.; Xu, Kefan; Bagchi, Saurabh (June 2020, Proceedings of the 18th International Conference on Mobile Systems, Applications, and Services)
null (Ed.)
Full Text Available

Search for: All records